Sequence logos: a new way to display consensus sequences.

نویسندگان

  • T D Schneider
  • R M Stephens
چکیده

A graphical method is presented for displaying the patterns in a set of aligned sequences. The characters representing the sequence are stacked on top of each other for each position in the aligned sequences. The height of each letter is made proportional to its frequency, and the letters are sorted so the most common one is on top. The height of the entire stack is then adjusted to signify the information content of the sequences at that position. From these 'sequence logos', one can determine not only the consensus sequence but also the relative frequency of bases and the information content (measured in bits) at every position in a site or sequence. The logo displays both significant residues and subtle sequence patterns.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visualizing bacterial tRNA identity determinants and antideterminants using function logos and inverse function logos

Sequence logos are stacked bar graphs that generalize the notion of consensus sequence. They employ entropy statistics very effectively to display variation in a structural alignment of sequences of a common function, while emphasizing its over-represented features. Yet sequence logos cannot display features that distinguish functional subclasses within a structurally related superfamily nor do...

متن کامل

enoLOGOS: a versatile web tool for energy normalized sequence logos

enoLOGOS is a web-based tool that generates sequence logos from various input sources. Sequence logos have become a popular way to graphically represent DNA and amino acid sequence patterns from a set of aligned sequences. Each position of the alignment is represented by a column of stacked symbols with its total height reflecting the information content in this position. Currently, the availab...

متن کامل

A simulated annealing algorithm for finding consensus sequences

MOTIVATION A consensus sequence for a family of related sequences is, as the name suggests, a sequence that captures the features common to most members of the family. Consensus sequences are important in various DNA sequencing applications and are a convenient way to characterize a family of molecules. RESULTS This paper describes a new algorithm for finding a consensus sequence, using the p...

متن کامل

RNALogo: a new approach to display structural RNA alignment

Regulatory RNAs play essential roles in many essential biological processes, ranging from gene regulation to protein synthesis. This work presents a web-based tool, RNALogo, to create a new graphical representation of the patterns in a multiple RNA sequence alignment with a consensus structure. The RNALogo graph can indicate significant features within an RNA sequence alignment and its consensu...

متن کامل

On Base-Pairing Potential Between 16S rRNA and 5’ UTR in Archaebacterial Genomes

The Shine-Dalgarno (SD) sequence [4] of E. coli is known to be a signal to initiate translation. The widely accepted model is that the 3’ end of 16S rRNA base-pairs with the SD sequence in the first step of ribosome binding to mRNA. However, archaebacteria have been supposed to have systems of translation different from those of eubacteria and eucaryotes. Further, some eubacteria, such as M. ge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Nucleic acids research

دوره 18 20  شماره 

صفحات  -

تاریخ انتشار 1990